Pretest estimation in combining probability and non-probability samples

نویسندگان

چکیده

Multiple heterogeneous data sources are becoming increasingly available for statistical analyses in the era of big data. As an important example finite-population inference, we develop a unified framework test-and-pool approach to general parameter estimation by combining gold-standard probability and non-probability samples. We focus on case when study variable is observed both datasets estimating target parameters, each contains other auxiliary variables. Utilizing design, conduct pretest procedure determine comparability with decide whether or not leverage pooled analysis. When comparable, our combines efficient estimation. Otherwise, retain only also characterize asymptotic distribution proposed estimator under local alternative provide data-adaptive select critical tuning parameters that smallest mean square error estimator. Lastly, deal non-regularity estimator, construct robust confidence interval has good finite-sample coverage property.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pretest probability assessment derived from attribute matching

BACKGROUND Pretest probability (PTP) assessment plays a central role in diagnosis. This report compares a novel attribute-matching method to generate a PTP for acute coronary syndrome (ACS). We compare the new method with a validated logistic regression equation (LRE). METHODS Eight clinical variables (attributes) were chosen by classification and regression tree analysis of a prospectively c...

متن کامل

Non-zero probability of nearest neighbor searching

Nearest Neighbor (NN) searching is a challenging problem in data management and has been widely studied in data mining, pattern recognition and computational geometry. The goal of NN searching is efficiently reporting the nearest data to a given object as a query. In most of the studies both the data and query are assumed to be precise, however, due to the real applications of NN searching, suc...

متن کامل

One-Class Classification by Combining Density and Class Probability Estimation

One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using either density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we present a simple method for one-class classification that combines the app...

متن کامل

Combining Probability Forecasts

Linear pooling is by the far the most popular method for combining probability forecasts. However, any nontrivial weighted average of two or more distinct, calibrated probability forecasts is necessarily uncalibrated and lacks sharpness. In view of this, linear pooling requires recalibration, even in the ideal case in which the individual forecasts are calibrated. Toward this end, we propose a ...

متن کامل

Acceptable random variables in non-commutative probability spaces

Acceptable random variables are defined in noncommutative (quantum) probability spaces and some of probability inequalities for these classes  are obtained. These results are a generalization of negatively orthant dependent random variables in probability theory. Furthermore, the obtained results can be used for random matrices.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronic Journal of Statistics

سال: 2023

ISSN: ['1935-7524']

DOI: https://doi.org/10.1214/23-ejs2137